Search CORE

71 research outputs found

Transcript expression-aware annotation improves rare variant interpretation

Author: Cummings Beryl B.
Daly Mark J.
Färkkilä Martti
Genome Aggregation Database Consor
Genome Aggregation Database Prod T
Holi Matti M.
Kallela Mikko
Kaprio Jaakko
Karczewski Konrad J.
Karjalainen Juha
Kosmicki Jack A.
MacArthur Daniel G.
Palotie Aarno
Ripatti Samuli
Tuomi Tiinamaija
Wessman Maija
Publication venue
Publication date: 28/05/2020
Field of study

The acceleration of DNA sequencing in samples from patients and population studies has resulted in extensive catalogues of human genetic variation, but the interpretation of rare genetic variants remains problematic. A notable example of this challenge is the existence of disruptive variants in dosage-sensitive disease genes, even in apparently healthy individuals. Here, by manual curation of putative loss-of-function (pLoF) variants in haploinsufficient disease genes in the Genome Aggregation Database (gnomAD)(1), we show that one explanation for this paradox involves alternative splicing of mRNA, which allows exons of a gene to be expressed at varying levels across different cell types. Currently, no existing annotation tool systematically incorporates information about exon expression into the interpretation of variants. We develop a transcript-level annotation metric known as the 'proportion expressed across transcripts', which quantifies isoform expression for variants. We calculate this metric using 11,706 tissue samples from the Genotype Tissue Expression (GTEx) project(2) and show that it can differentiate between weakly and highly evolutionarily conserved exons, a proxy for functional importance. We demonstrate that expression-based annotation selectively filters 22.8% of falsely annotated pLoF variants found in haploinsufficient disease genes in gnomAD, while removing less than 4% of high-confidence pathogenic variants in the same genes. Finally, we apply our expression filter to the analysis of de novo variants in patients with autism spectrum disorder and intellectual disability or developmental disorders to show that pLoF variants in weakly expressed regions have similar effect sizes to those of synonymous variants, whereas pLoF variants in highly expressed exons are most strongly enriched among cases. Our annotation is fast, flexible and generalizable, making it possible for any variant file to be annotated with any isoform expression dataset, and will be valuable for the genetic diagnosis of rare diseases, the analysis of rare variant burden in complex disorders, and the curation and prioritization of variants in recall-by-genotype studies.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders

Author: Biskup Saskia
Daly Mark J.
De Jonghe Peter
Du Juliana
EuroEPINOMICS-RES Consortium
Gormley Padhraig
Guerrini Renzo
Helbig Ingo
Helbig Katherine L.
Koeleman Bobby P.C.
Kosmicki Jack A.
Krause Roland
Kurki Mitja
Lal Dennis
Majitha Amit R.
Marini Carla
May Patrick
Møller Rikke S.
Neubauer Bernd A.
Niestroj Lisa M.
Nürnberg Peter
Palotie Aarno
Perez-Palma Eduardo
Poduri Annapurna
Robinson Elise B.
Samocha Kaitlin E.
Tang Sha
Ware James S.
Weber Yvonne G.
Weckhuysen Sarah
Wu Sitao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Abstract Background Classifying pathogenicity of missense variants represents a major challenge in clinical practice during the diagnoses of rare and genetic heterogeneous neurodevelopmental disorders (NDDs). While orthologous gene conservation is commonly employed in variant annotation, approximately 80% of known disease-associated genes belong to gene families. The use of gene family information for disease gene discovery and variant interpretation has not yet been investigated on a genome-wide scale. We empirically evaluate whether paralog-conserved or non-conserved sites in human gene families are important in NDDs. Methods Gene family information was collected from Ensembl. Paralog-conserved sites were defined based on paralog sequence alignments; 10,068 NDD patients and 2078 controls were statistically evaluated for de novo variant burden in gene families. Results We demonstrate that disease-associated missense variants are enriched at paralog-conserved sites across all disease groups and inheritance models tested. We developed a gene family de novo enrichment framework that identified 43 exome-wide enriched gene families including 98 de novo variant carrying genes in NDD patients of which 28 represent novel candidate genes for NDD which are brain expressed and under evolutionary constraint. Conclusion This study represents the first method to incorporate gene family information into a statistical framework to interpret variant data for NDDs and to discover new NDD-associated genes

Kölner UniversitätsPublikationsServer

Institutional Repository Universiteit Antwerpen

Helsingin yliopiston digitaalinen arkisto

Open Repository and Bibliography - Luxembourg

Genetic risk for autism spectrum disorders and neuropsychiatric variation in the general population

Almost all genetic risk factors for autism spectrum disorders (ASDs) can be found in the general population, but the effects of that risk are unclear in people not ascertained for neuropsychiatric symptoms. Using several large ASD consortia and population based resources, we find genetic links between ASDs and typical variation in social behavior and adaptive functioning. This finding is evidenced through both inherited and de novo variation, indicating that multiple types of genetic risk for ASDs influence a continuum of behavioral and developmental traits, the severe tail of which can result in an ASD or other neuropsychiatric disorder diagnosis. A continuum model should inform the design and interpretation of studies of neuropsychiatric disease biology

Crossref

Harvard University - DASH

Copenhagen University Research Information System

PubMed Central

Birkbeck Institutional Research Online

MPG.PuRe

Explore Bristol Research

University of Queensland eSpace

Recommended from our members

A framework for the interpretation of de novo mutation in human disease

Author: Boerwinkle Eric
Buxbaum Joseph D.
Cook Edwin H.
Daly Mark J.
dePristo Mark
Devlin Bernie
Gabriel Stacey B.
Gibbs Richard A.
Kirby Andrew
Kosmicki Jack A.
MacArthur Daniel G.
Mallick Swapan
McGrath Lauren M.
Neale Benjamin M.
Palotie Aarno
Purcell Shaun M.
Rehnström Karola
Robinson Elise B.
Roeder Kathryn
Sabo Aniko
Samocha Kaitlin E.
Sanders Stephan J.
Schellenberg Gerard D.
Stevens Christine
Sutcliffe James S.
Wall Dennis P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2015
Field of study

Spontaneously arising (‘de novo’) mutations play an important role in medical genetics. For diseases with extensive locus heterogeneity – such as autism spectrum disorders (ASDs) – the signal from de novo mutations (DNMs) is distributed across many genes, making it difficult to distinguish disease-relevant mutations from background variation. We provide a statistical framework for the analysis of DNM excesses per gene and gene set by calibrating a model of de novo mutation. We applied this framework to DNMs collected from 1,078 ASD trios and – while affirming a significant role for loss-of-function (LoF) mutations – found no excess of de novo LoF mutations in cases with IQ above 100, suggesting that the role of DNMs in ASD may reside in fundamental neurodevelopmental processes. We also used our model to identify ~1,000 genes that are significantly lacking functional coding variation in non-ASD samples and are enriched for de novo LoF mutations identified in ASD cases

Harvard University - DASH

Whole-genome sequencing reveals host factors underlying critical COVID-19

Author: 23andMe
Abecasis Goncalo R
Arumugam Prabhu
Baillie J Kenneth
Baras Aris
Bentley David
Bretherick Andrew D
Caulfield Mark J
Chan Georgia
Covid-19 Human Genetics Initiative
Donovan Sally
Elgar Greg
Elliott Katherine S
Elliott Paul
Fawkes Angie
Ferreira Manuel AR
Fowler Tom A
GenOMICC Investigators
Goddard Peter
Griffiths Fiona
Hendry Sara Clohisey
Hinds Charles
Horby Peter
Horowitz Julie E
Justice Anne
Keating Sean
Kingsley Clare
Klaric Lucija
Knight Julian
Kosmicki Jack A
Kousathanas Athanasios
Law Andy
Ling Lowell
Maleady-Crowe Fiona
Malinauskas Tomas
Maslove David
McAuley Danny
Millar Jonathan
Mirshahi Tooraj
Montgomery Hugh
Morrice Kirstie
Moutsianas Loukas
Murphy Lee
Nichol Alistair
Odhams Christopher A
Oetjens Matthew
Oosthuyzen Wilna
Openshaw Peter JM
Pairo-Castineira Erola
Parkinson Nick
Patch Christine
Ponting Chris P
Rader Daniel J
Rawlik Konrad
Rendon Augusto
Rhodes Daniel
Ritchie Marylyn D
Rowan Kathy
Russell Clark D
Salehi Shahla
Scott Richard H
Semple Malcolm G
Shankar-Hari Manu
Shen Xia
Siddiq Afshan
Stuckey Alex
Summers Charlotte
Tenesa Albert
Todd Linda
Verma Anurag
Vitart Veronique
Walker Susan
Walsh Timothy
Wang Bo
Wilson James F
Wu Yang
Yang Jian
Zainy Tala
Zechner Marie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/07/2022
Field of study

Critical Covid-19 is caused by immune-mediated inflammatory lung injury. Host genetic variation influences the development of illness requiring critical care1 or hospitalisation2-4 following SARS-CoV-2 infection. The GenOMICC (Genetics of Mortality in Critical Care) study enables the comparison of genomes from critically-ill cases with population controls in order to find underlying disease mechanisms. Here, we use whole genome sequencing in 7,491 critically-ill cases compared with 48,400 controls to discover and replicate 23 independent variants that significantly predispose to critical Covid-19. We identify 16 new independent associations, including variants within genes involved in interferon signalling (IL10RB, PLSCR1), leucocyte differentiation (BCL11A), and blood type antigen secretor status (FUT2). Using transcriptome-wide association and colocalisation to infer the effect of gene expression on disease severity, we find evidence implicating multiple genes, including reduced expression of a membrane flippase (ATP11A), and increased mucin expression (MUC1), in critical disease. Mendelian randomisation provides evidence in support of causal roles for myeloid cell adhesion molecules (SELE, ICAM5, CD209) and coagulation factor F8, all of which are potentially druggable targets. Our results are broadly consistent with a multi-component model of Covid-19 pathophysiology, in which at least two distinct mechanisms can predispose to life-threatening disease: failure to control viral replication, or an enhanced tendency towards pulmonary inflammation and intravascular coagulation. We show that comparison between critically-ill cases and population controls is highly efficient for detection of therapeutically-relevant mechanisms of disease

UCL Discovery

De novo Variants in Neurodevelopmental Disorders with Epilepsy

Author: Abou Jamra Rami
Caglayan Hande
Craiu Dana
Daly Mark J
De Jonghe Peter
EuroEPINOMICS-RES Consortium
Guerrini Renzo
Helbig Ingo
Helbig Katherine L
Heyne Henrike O
Koeleman Bobby P C
Kosmicki Jack A
Lal Dennis
Lemke Johannes R
Linnankivi Tarja
May Patrick
Muhle Hiltrud
Møller Rikke S
Neubauer Bernd A
Palotie Aarno
Pendziwiat Manuela
Poduri Annapurna
Singh Tarjinder
Sisodiya Sanjay M
Stamberger Hannah
Striano Pasquale
Tang Sha
Weber Yvonne G
Weckhuysen Sarah
Wu Sitao
Publication venue: Cold Spring Harbor Labs Journals
Publication date: 01/01/2018
Field of study

Neurodevelopmental disorders (NDD) with epilepsy constitute a complex and heterogeneous phenotypic spectrum of largely unclear genetic architecture. We conducted exome-wide enrichment analyses for protein-altering de novo variants (DNV) in 7088 parent-offspring trios with NDD of which 2151 were comorbid with epilepsy. In this cohort, the genetic spectrum of epileptic encephalopathy (EE) and nonspecific NDD with epilepsy were markedly similar. We identified 33 genes significantly enriched for DNV in NDD with epilepsy, of which 27.3 were associated with therapeutic consequences. These 33 DNV-enriched genes were more often associated with synaptic transmission but less with chromatin modification when compared to NDD without epilepsy. On average, only 53 of the DNV-enriched genes were represented on available diagnostic sequencing panels, so our findings should drive significant improvements of genetic testing approaches

UCL Discovery

Institutional Repository Universiteit Antwerpen

Open Repository and Bibliography - Luxembourg

Archive ouverte UNIGE

Analysis of protein-coding genetic variation in 60,706 humans

Author: Abboud
Abecasis
Aguilar-Salinas
Altshuler David M.
Ardissino Diego
Arellano-Campos
Atzmon
Aukrust
Banks Eric
Barr
Bell
Bergen
Berghout Joanne
Birnbaum Daniel P.
Bjørkhaug
Blangero
Boehnke Michael
Bowden
Budman
Burtt
Centeno-Cruz
Chambers
Chambert
Clarke
Collins
Cooper David N.
Coppola
Cortes
Cox
Cummings Beryl B.
Córdova
Daly Mark J.
Danesh John
Deflaux Nicole
DePristo Mark
Do Ron
Donnelly Stacey
Duggirala
Duncan Laramie E.
Elosua Roberto
Estrada Karol
Farrall
Fennell Timothy
Fernandez-Lopez
Flannick Jason
Florez Jose C.
Fontanillas
Frayling
Freimer
Fromer Menachem
Fuchsberger
Gabriel Stacey B.
García-Ortiz
Gauthier Laura
Getz Gad
Glatt Stephen J.
Goel
Goldstein Jackie
González-Villalpando
González-Villalpando
Grados
Groop
Gupta Namrata
Gómez-Vázquez
Haiman
Hanis
Hattersley
Henderson
Hill Andrew J.
Hopewell
Howrigan Daniel
Huerta-Chagoya
Hultman Christina M.
Islas-Andrade
Jacobs
Jalilzadeh
Jenkinson
Jiménez-Morale
Karczewski Konrad J.
Kathiresan Sekar
Kiezun Adam
King
Kirov
Kooner
Kosmicki Jack A.
Kurki Mitja I.
Kyriakou
Kähler
Laakso Markku
Lee
Lehman
Lek Monkol
Lyon
MacArthur Daniel G.
MacMahon
Magnusson
Mahajan
Marrugat
Martínez-Hernández
Mathews
McCarroll Steven
McCarthy Mark I.
McGovern Dermot
McPherson Ruth
McVean
Meigs
Meitinger
Mendoza-Caamal
Mercader
Minikel Eric V.
Mohlke
Moonshine Ami Levy
Moran
Moreno-Macías
Morris
Najmi
Natarajan Pradeep
Neale Benjamin M.
Njølstad
O'Donnell-Luria Anne H.
O'Donovan
Ordóñez-Sánchez
Orozco Lorena
Owen
Palotie Aarno
Park
Pauls
Peloso Gina M.
Pierce-Hoffman Emma
Poplin Ryan
Posthuma
Purcell Shaun M.
Revilla-Monsalve
Riba
Ripke
Rivas Manuel A.
Rodríguez-Guillén
Rodríguez-Torres
Rose Samuel A.
Ruano-Rubio Valentin
Ruderfer Douglas M.
Saleheen Danish
Samocha Kaitlin E.
Sandor
Scharf Jeremiah M.
Seielstad
Shakir Khalid
Sklar Pamela
Sladek
Soberón
Spector
Stenson Peter D.
Stevens Christine
Sullivan Patrick F.
Tai
Teslovich
Thomas Brett P.
Tiao Grace
Tsuang Ming T.
Tukiainen Taru
Tuomilehto Jaakko
Tusie-Luna Maria T.
Walford
Ware James S.
Watkins Hugh C.
Weisburd Ben
Wilkens
Williams
Wilson James G.
Won Hong-Hee
Yu Dongmei
Zhao Fengmei
Zou James
Publication venue
Publication date: 01/01/2016
Field of study

Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. We describe the aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of truncating variants with 72% having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human “knockout” variants in protein-coding genes

Carolina Digital Repository

Exome-wide association study to identify rare variants influencing COVID-19 outcomes: Results from the Host Genetics Initiative

Author: Adam Butterworth
Akinori Kimura
Albandari Binowayn
Alberto Gómez-Carballa
Alessandra Renieri
Alex Stuckey
Alexander W. Charney
Alexandre Bolze
Alexis Stephens
Alfredo Gonzalez
Amal Almutairi
Amy D. Stockwell
Andrea Ganna
Antonio Salas
Anurag Verma
Axel Schmidt
Bartlomiej Przychodzen
Bogdan Pasaniuc
Brian Yaspan
Carlo Rivolta
Chadi Saad
Chureerat Phokaew
COVID-19 Host Genetics Initiative
Daniel H. Geschwind
Daniel M. Jordan
David B. Goldstein
David Langlais
David Morrison
DeCOI Host Genetics Group
Diane Del Valle
Ebtehal A. Alsolm
Efren Sandoval
Elifnaz Çelik
Elizabeth T. Cirulli
Elżbieta Kaja
Eric E. Schadt
Esther Cheng
Eva C. Schulte
Fabian Brand
Fang Cai
Fatima Alqubaishi
Fawz S. Al Harthi
Federico Martinón-Torres
Francesca Fava
Francesca Mari
Francisco Tanudjaja
GEN-COVID consortium (Spain)
GEN-COVID Multicenter Study (Italy)
GenOMICC Consortium
Guillaume Bourque
Guillaume Butler-Laporte
Gundula Povysil
Hadeel El Bardisy
Hamdi Mbarek
Ho Namkoong
Hugo Zeberg
Ilaria Meloni
Irene Rivero-Calle
Iva Neveux
J Brent Richards
J Kenneth Baillie
Jack A. Kosmicki
Jacobo Pardo-Seco
Japan COVID-19 Task Force
Jessica Nordlund
Joseph D. Buxbaum
Joseph J. Grzymski
Junghyun Jung
Karolina Chwialkowska
Katsushi Tokunaga
Kelly M. Schiabor Barrett
Kerstin U. Ludwig
Klaudia Walter
Koichi Fukunaga
Konrad J. Karczewski
Kousik Kundu
Krzysztof Kiryluk
Kumar Veerapen
Laura G. Sloofman
Maciej Dabrowski
Magdalena Niemira
Malak S. Abedalthagafi
Manal Alaamery
Manish J. Butte
Mansour S. Almutairi
Manuel A. R. Ferreira
Marcin Moniuszko
Mark Lathrop
Masaya Sugiyama
Mateusz Sypniewski
Mathieu Bourgey
Mathieu Quinodoz
Michael E. Broudy
Michael Hultström
Miklos Lipcsey
Miriam Merad
Miroslaw Kwasniewski
Mohammad Fawzy
Monnat Pongpanich
Mount Sinai Clinical Intelligence Center
Nattiya Hirankarn
Nicolas Casadei
Nicole L. Washington
Nicole Simons
Nicole Soranzo
Nikhil Chavan
Ning Shang
Noam D. Beckmann
Nora Aljawini
Olaf Riess
Pajaree Chariyavilaskul
Paul C. Boutros
Paul J. Tung
Pawel Olszewski
Pawel Zawadzki
Pierre-Yves Bochud
Regeneron Genetics Center
Robert Frithiof
Robert JM Eveleigh
Robert Sebra
Ruth Johnson
Ryan C. Thompson
Ryuya Edahiro
Sacha Gnjatic
Said I. Ismail
Salam Massadeh
Saleh A. Alqahtani
Sandra Smieszek
Sarah Alotaibi
Satoru Miyano
Seishi Ogawa
Seiya Imoto
Serghei Mangul
Sergio Daga
Seunghee Kim-Schulze
Shaun Dabe
Shu Tao
Simon White
Simone Furini
Stefan Eng
Stephan Ossowski
Stephanie Arteaga
Stephanie Bibert
Stephen Riffle
Susanne Motameny
Szymon Pula
Takafumi N. Yamaguchi
Takanori Hasegawa
Takanori Kanai
Tatsuhiko Naito
Tess D. Pottinger
Theodore Drivas
Timothy Chang
Timothy Sanders
Tomoko Nakanishi
Urszula Korotko
Vincent Mooser
Vincenzo Forgetta
Voraphoj Nilaratanakul
Vorasuk Shotelersuk
Wanna Chetruengchai
William Lee
Xabier Bello
Yanara Marincevic-Zuniga
Yaseen M. Arabi
Yosuke Kawai
Yu Pan
Yukinori Okada
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2022
Field of study

Archivio della Ricerca - Università degli Studi di Siena